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The Barabasi-Albert (BA) model is extended to include the concept of local world and 
the microscopic event of adding edges. With probability p, we add a new node with m 
edges which preferentially link to the nodes presented in the network; with probability 
1 — p, we add m edges among the present nodes. A node is preferentially selected by its 
degree to add an edge randomly among its neighbors. Using continuum theory and rate 
equation method we get the analytical expressions of the power-law degree distribution 
with exponent 7 = 3 and the clustering coefficient c(fc) ~ + c. The analytical 

expressions are in good agreement with the numerical calculations. 
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1. Introduction 

Various complex systems both in nature and in society, such as the Internet, 1 cells 
2 ' 3,4 and social networks can all be described by networks. Despite their diversi- 
ties, most networks appearing in nature follow universal organizing principles. In 
particular, it has been found that the degree distributions of most networks decay 
as a power law, following p(k) ~ fc~ 7 , where 7 is the degree exponent varied be- 
tween 2 and 3, and the average shortest path length increases with the system size 
logarithmically. 5 ' 6,7 Especially for social networks, which show high clustering for 
their high transitivity. That is, if person A knows person B and C, then B and C are 
likely to know each other. The previous studies on social networks have originated 
from Rapport's studies of disease spreading. 8 To understand the effect of network 
topology on the pattern of disease spread, one must first understand its topology 
and evolving mechanisms. 

The small-world network model (typical WS model) proposed by Watts and 
Strogatz has both the properties of high clustering and a short path length, while 
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the degree distribution is similar to Poisson distribution, not coincident with the 
real- world network of power-law distribution. 9 The origin of scale-free properties is 
well understood in terms of interaction that generates this topology dynamically by 
the Barabasi- Albert model, e.g., on the basis of network growth and preferentially 
attachment. 10 Although the model captures the power-law tail of the degree distri- 
bution, it has other properties that may not agree with the empirical results of the 
real networks, such as the property of high clustering. In fact, it may be impossible 
to construct an universal model to describe all kinds of networks well. 

A great number of attempts have been made to construct a specific model with 
some properties to coincide with some special networks. 11,12 ' 13,14 ' 15,16,17 Davidsen 
et al constructed a small world network with local interaction to model acquaintance 
networks. The model is formulated with fixed number of N nodes. The local connec- 
tion rule is based on "transitive linking" . By simulation, he got a scale- free network 
with parameter of probability p and the network had the small- world character. 18 
Holme et al constructed an extended model of scale-free network to include a "triad 
formation step". They also obtained a scale-free network with tunable clustering 
and a short path length. 19 Then Szabo et al gave the analytical expression of the 
clustering coefficient varied with degree k for this model. 20 In this paper, we propose 
another extended model to make an attempt to explore the evolving mechanisms 
of social networks. 

There are many local communities existing in social networks according to dif- 
ferent interests or ages, etc. For example, in a collaboration network of scientists at 
the Santa Fe Institute, scientists are classified into different communities according 
to their similarity either of research topic or methodology. The communities include 
scientists whose research are on the structure of RNA, statistical physics and math- 
ematical ecology. 21 Another character of the social network is its high transitivity. 
It is more reasonable to consider that the center person is more likely to make his 
neighbors acquainted by introducing them to each other than a person selected ran- 
domly. Based on these two assumptions we propose an evolving model of scale-free 
network with tunable clustering. 

The rest of the paper is organized as follows: Sec. 2 provides a brief review of 
the Barabasi-Albert scale-free network generation algorithm, then our model and 
algorithm is introduced, followed by analytical calculations with continuum theory 
and rate equations method in Sec. 3. Finally, some conclusions are given in Sec. 4. 

2. Models 

2.1. The Barabasi- Albert scale-free model 

The BA model is briefly reviewed first and then our algorithm is proposed. The BA 
model is defined as follows 10 : 

(i) Initial condition: The network consists of m n nodes and no edges. 

(ii) Growth : At each time step, add a new node with m(m < Too) edges. 

(Hi) Preferential attachment: Each edge of the new node is attached to the exist- 
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ing node i. The probability J| that a new node will be connected to node i depends 
on the degree h of node i, such that II i s proportional to its deg rcG ki^ tli&t is; 

n<*> - & 

3 

After t time steps, the system develops to be a network with TV = m a +t nodes. 
With continuum theory, Barabasi and Albert calculated the degree distribution p(k) 
analytically. The probability p(k) that a randomly chosen node has k links decays 
as a power law, p(k) ~ ^p-. 



2.2. Our Model 

The BA model can be extended with the concept of local world and the microscopic 
event of adding edges. The local-world network was proposed by Li, 14 which is based 
on the appearance of local world in a superfamily of protein in protein interaction 
network, human community etc. So the local world concept is also introduced in 
our model. The model is generated by the following algorithm: 

(i)Initial condition: Start with tuq nodes and eo edges, and select M nodes 
randomly from the present network, referred to as the local community of the new 
coming node. 

(n)With probability p, add a new node with m edges. The new node attaches 
to an existing node i of the local community with the probability II (^»); which is 
proportional to its degree. 

= S local community) ■ — — (1) 

2-^j^localcomvaunity j 

wheref|(i S local community) = % = m ^+ pt ~ • 

(iw)With probability 1 — p, add m edges. A node is selected according to the 
preferential attachment rule and then an edge is randomly added among its neigh- 
bors. Just as in social networks, the more famous a person is, the more likely he 
is to pick two persons of his neighbors randomly and introduce them to know each 
other. 

These steps are then iterated. After t time steps, the network develops to be 
N = mo + pt nodes and 2mt edges, then the average degree of the network is: 

2mt 2m 

< k >= w — 2 

m +pt p 

3. Analytical Results 

The continuum theory and rate equations method are used to calculate the degree 
distribution p(k) and the clustering coefficient c(k) for various degree k. According 



2, 2008 16:55 WSPC/INSTRUCTION FILE new 



4 



to continuum theory, fcj is supposed to change continually. The probability n(^i) 
is the rate at which ki changes. For node i of degree fc,, ki changes as 

— pm J^J(fcj) + (l— p)m £ local community) — — • y— (3) 

Z-/ j £localcommunity 3 n 

where f2 is the neighbors set of node i. The first term in the sum corresponds to 
node i selected preferentially to link to the new node in step (ii); the second term 
is own to step (Hi). Node i is one of the neighbors of preferentially selected node 
n, then node i is randomly selected to add an edge to increase the degree of node 
i. k n denotes the degree of the neighboring node n of node i. 
Using(2), we can obtain: 

^1 = pm Y\(i local community) 



dt ^ ll v V k- 

j tlocalcommunity 3 

ki • k n 1 
k~ ' ~k~ 

j^localcornrnunity 3 n 



+ (1 — p)m ■ J^J(j G local community) 



M k, M k. 

' P m ' —r ■ tj — ~t~ + (1 - P)m ■ — 



pt M- < k> ' pt M- < k> 

M k, 



m 



pt M- < k> 
~ 2t 

So the change rate of degree h is given by: 

Oki ki 
~dt~2t 

then 

t 



(4) 



(5) 



h{t) » m( F ) 1 / 2 (6) 

With continuum theory we can also obtain the same degree distribution as that of 
the BA model. The degree distribution follows the same power law, p(k) ~ -p-. 

Fig.l shows that the analytical calculation is in good agreement with numerical 
result. 

To calculate the clustering coefficient of node i, the average rate of change for ej 
must be considered, where e, is the number of connected neighbors of node i. Then 
by using Ci(h) = ki (k--i) , w e can get Ci(ki) for node i of degree ki. 

2 

For node i, its connected neighbors e« changes for two cases as shown in Fig. 2. 
Case I shows the step (ii) to add a new node. The new node coming into system 
preferentially attaches to node i and one of its neighbors to form a triangle; Case 
II shows the step (Hi) to add an edge. Node i is preferentially selected to pick two 
of its neighbors randomly to link. This process also increases the number of e,, so 
the rate equation for e, is as follows: 



February 2, 2008 16:55 WSPC/INSTRUCTION FILE new 



5 




2 25 



Fig. 1. Degree distribution p(k) in log-log scale with N = 10000, mo = 25, M = 20, m = 
3, p = 0.97. Star-points represent numerical simulation; Solid line is an analytical expression with 




i n 



Fig. 2. Two possible cases to increase the number of connected neighbors e; of node i. Case I: 
The new node preferentially attaches to node i and one of its neighbors in step [it). Case II: Two 
neighbors of node i are randomly selected to add an edge in step (Hi). 



I : — — = pm TT(i £ local community)— — 

2-~d j £localcommunity 3 

U 

■{pm — 1) £ local community) ■ — (7) 

2-~d j £localcommunity) 3 

where f2 is still the neighbors set of node i. The right side of Eq.(7) corresponds to 
case I in Fig. 2. The new node preferentially links to node i and one of its neighbors. 



I 



I 
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Eq.(7) is equivalent to the following: 

T de t M ki , . M ki- < k n > 

1 '■ -TT- — P m ' ' ; 7T ' (P m — 1 ) ' ' ~TT ; 

dt m a +pt <k>-M m +pt M-<k> 

ui • ki 1 ki • < /c n >■ 
f« — • pm - 1) • — • : 

<k>t pt <k> 

m(pm — 1) kf 



p t 2 - < k > 2 

m{pm — 1) fc 2 p < k > 



p t 2 - < k > 2 4 

m(pm — 1) /c 2 



* -InN 



4 t 2 - <k> 

p(pm - 1) fc? 



inJV 



- 8 f lnN (8) 

In Eq.(8), the average degree of the first neighbor of node i is denoted by < k n >. 
For uncorrected random BA model, < k n >= p, < fc> • InN, see reference 22 . 
For part II, we have: 

»:| = (1"J)'* ( 9 ) 

II is the process of adding edges in Fig. 2 case II. Node i is preferentially selected to 
add an edge among its neighbors randomly to increase e\. 
Integrating Eq.(8)and Eq.(9), respectively, we obtain: 

J:e)= /'Pfem-l).a. in(pt)d( 



P{pm - l) mH ln{pt)dt 



i 8 t 2 ■ U 

jf p: pm - I) III 2 



8U t 

~p p(pm — 1) • m 2 
8^ 

p(pm — 1) • m 
16*7 

p 2 (pm-l) (InN) 2 
16 



ln(pt)dt 
ln{pt)d{ln{pt)) 



2 



(InN) 



2 



(10) 



11:* = J (1 — p)mJ^J(i e local community) 



k 

j j tlocalcommunity J 



■dt 



(1 — p)m ■ — • — : — dt 

K 1 ' pt M- <k> 
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2 t 



-d-P)/ T t* 

*(l-p)(k,(^)-eo) (ID 

By summing 7 and 77, the number of connected neighbors of node i is ob- 
tained: 

p 2 (pm-\) (InN) 2 , 2 ,„ w , ,AT, , , „, 

= e*o + '- ■ ■ kf + (1 - p)(fci(-) - e ) (12) 

with Cj = kjikj-i) ) the general expression of the clustering coefficient Ci(fci) has 

2 

the form 

2(1 -p) (pm - l)p 2 (InAQ 2 
~ fc, + 8 JV [6) 

This implies that 3/c* satisfied that: 

(pm - l)p 2 ■ (InN) 2 { ' 

Then 

„ _ (pm - l)p 2 (lnN) 2 



(15) 



So the general solution of the clustering coefficient has the form c(k) = 2 ^ 1 r p - ) + 
^- . The change rate of the clustering coefficient c(k) can be tuned by parameter p. 
Fig. 3 displays the results of simulation and analytical prediction. 



4. Conclusions 

In this paper a developing network with adjustable parameter was proposed to 
control the change of the clustering coefficient. The assumption of this model can 
be justified by the appearance of local communities and high transitivity in social 
networks. With continuum theory and rate equation method, the analytical expres- 
sions of the degree distribution and the clustering coefficient have been derived. The 
analytical predictions are in good agreement with the numerical results. However, 
we just made a first step to explore and understand the evolving mechanisms of 
social networks. Many other important factors need to be considered in the future, 
such as the left of some persons periodically. Also for simplicity we just selected the 
local world randomly from the network. 
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Fig. 3. The clustering coefficient c vs degree k with N = 10000, mo = 25, M = 20, m = 3,p = 

0. 97. Simulation results (o); Least-square-fit curve (.); Prediction curve (-). c(fe) = 2^2£ + 0.004. 
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